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ABSTRACT 

According to classical test theory, the presence of 
random measurement error in a psychological test has important 
implications for validation studies. The more comprehensive 
application of classical test theory in construct validation is 
distinguished from that in criterion-oriented validation. Critics 
thematic apperceptive measurement of the achievement motive have 
often blurred this distinction and have consequently failed to 
appreciate the construct validity of this motive measure. Using 
explicit true score measurement models of theoretical constructs, 
evidence for the construct validity of thfe achievement motive in a 
.-^presentative sample of adult males in America (n=508) is described. 
Evidence is presented that the achievemesnt motive construct is 
related in theoretically expected ways to constructs of academic 
achievement and work satisfaction, desj^ite the presence of 
substantial random measurement error in thematic apperceptive 
measures of the motive. Evidence for the discriminant validity of 
story content from story length in the thematic apperception test 
also presented in this nomological network. (Author/PN) 
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Abstract 

According to clattlcal test theory, the presence of random measurement 
error in a psychological test has Important implications for validation 
studies. We distinguish the more comprehensive application of classical 
test theory In construct validation from that in criterion-oriented 
validation. Critics of thematic apperceptive measurement of the 
achievement motive have often blurred this distinction and have 
consequently failed to appreciate the construct validity of this motive 
measure. Using explicit true score measurement .models of theoretical 
constructs, we present evidence for the construct validity of the 
achievement mot.lve In a representative sample of adult males In America 
(N - 508). We present evidence that the achievement motive construct is 
related In theoretically expected ways to constructs of academic 
achievement and work satisfaction, despite the presence of substantial 
random measurement error in ' thematic apperceptive measures of the 
motive. Evidence for the discriminant validity of story content from 
story length In the thematic apperception test Is also presented In this 
nomo log leal network. 
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The thematic apperception teit (TAT) and the concept of need for 
achlavament (n AchleVament) have a ihared hlitory which datei back more 
than i*Q yeart^^CMurray , 1938) • Ai a theoretical conitructi the 
achlavamant motive rapraiantt a broad, affectively-toned dlipoiltlon to 
compata/wlth a itandard of axcelience. The motive construct )• expected 
to influence behavior In an extensive set of situations where 
Individuals experience pride In accomplishment and disappointment in 
failure (Atkinson, 1958; Atkinson and Feather, I966; Atkinson and 
Raynor, 197i*; McClelland, Atkinson, Clark, end Lowell, 1953; McClelland 
..and Winter, 19^9; Veroff and Feld, 197/0). 

As a technique for measuring the achievement motive, the TAT has 
generally shown modest test-rctest and Internal consistency reliability 
(Murstein, 1963), Critics have repeatedly contended that the low 
reliability of thematic apperceptive measures precludes the 
demonstration of validity for the achievement motive (Entwisle, 1972; 
FIneman, 1977; Kllnger, 1966). This argument Is based on a fundamental 
principle of classical test theory (Lord and Novick, I968) ; namely, that 
the validity of a test with respect to any cr I ter i on; cannot exceed the 
Index of reliability of the test (I.e., the square f/oot of reliability). 
In other words, high reliability is a necessary but not a sufficient 
condition for high criterion validity. Criterion validity Includes both 
predictive validity and concurrent validity, but differs In Important 
respects from"bx)nstruct validity (Cronbach and Meehl, 1955)* Successful 
criterion validation depends only on the degree of observed covariation 
between predictor variables and a criterion variable. 
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Some writers have argued that the achievement motive as assessed by 
the TAT lacks convergent validity (e.g., FIneman, 1977) and Interpret 
this failure In light of the unrel labll I ty of , the TAT. Similarly, 
writers have argued that the achievement motive as assessed by the TAT 
lacks discriminant validity. Entwisle (1972). for example, has argued 
that thematic apperceptive measures of the achievement motive are 
confounded with TAT story length, ^'conceived as an Indicator of verbal 
achievement and probably also of academic socialization" (p.367) • This 
claim Is based on the observation that TAT n Achievement and TAT story 
length covary positively, and observed zero-order correlations between 
TAT story length and grades in school are nominally larger than zero- 
order correlations between TAT n Achievement and grades In school. 
Entwisle suggests that evidence of the discriminant validity of TAT 
n Achievement wl th respect to TAT story length Is not forthcoming, 
because measurement error in thematic apperceptive measures of the 
achievement motive severely limits the capacity of the test to correlate 
wl th any cr I ter I on. 

These critics have, evaluated the convergent and discriminant 
validity of TAT n Achievement by examination of cr i ter i on va 1 I dl ty 
coefficients relating observed test scores. Such criteripn validity 
coefficients confound measurement error in the observed variables with 
the true relationships between theoretical constructs. We believe that 
these Issues of convergent and discriminant validity are inadequately 
addressed by examination of criterion validity coefficients alone and 
ought to be placed In the fnore comprehensive context of construct 
validation (Alwln* 197^).^ The search for convergence among tests 
presupposes that no single criterion Is fully adequate to define the 
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underlying dimention of interett; thit pretuppoti t i on It precltely what 
Cronbach and/Meehl {1955i p. 282) define as construct validation. 
SImiiarly, discriminant vai idi ty presupposes the logic of construct 
validation. In order to Justify why two observed tests ought to show 
discriminant validity, we require laws in.a nomological network which 
relate (a) the observed tests to other observed tests, (b) theoretical 
constructs to the observed tests, and (c) these different theoretical 
constructs to each other (Cronbach and Meehl , 1955» p.290) . Clearly, 
construct validation goes beyond cr 1 ter ion validation in requiring a 
theoretical context for evaluating validity. 

We* believe that critics of. TAT n Achievement' have selectively 
Invoked the logic of classical test^^the&ry by resti^JctJ^ their 
attention to cr I ter I on va 1 I dl ty coef f 1 dents . We apply -rtassicai test 
theory In the fuller context of construct validation by distinguishing 
between ''observed" (criterion) and "true" (construct) validity 
coefficients, the latter having been d I sattenuated for measurement 
error. We propose to examine evidence for the construct validity of TAT 
n Achievement In an investigation of the discriminant validity of the 
achievement motive and verbal achievement. We predicate our examination 
pf relationships Involving these theoretical constructs on Justifiable 
measurement models of the constructs. 

The achievement motive Is conceived as an affective disposition to 
find those situations attractive in which personal performance is 
aval ua ted aga! nst some* standard of excel lence . Because performance 
evaluation Is typically made salient In academic situations, researchers 
(e.g., Entwjsle, 1972) assume that, the achievement motive ought to 
covary posi tlvely with academic achievement, measured by grades or years 
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of schooling completed, for Instance. Beciuse schools Invariibly demand 
verbal achievement! it is reasonable to suppose that greater verbal 
achievement, measured by TAT story length, will also be associated with 
higher academic achievement. On the other hand, we expect that 

affective concern over doing well will be systematically related to 
behavior in^many^sTTuations' In which verbal skills are only incidentally 
required. We hypothesize that work settings fall Into this class of 
situations. Affective reactions to work ought to be related to the 
achievement motive, but not to verbal achievement. If the achievement 
motive and verbal achievement (both measured by the TAT) are found to be,, 
related to a relevant construct (e.g., work satisfaction) in different 
waySf they should be conceptualized as distinct constructs, even though 
they may covary positively with each other. This would represent 
evidence for the construct validity of TAT n Achievement (Cronbach and 
Meehl , 1955) • 

Selecting a measurement model for TAT n Achievement 

Classical true score theory partitions the variation in a measured 
variable (X) i nto, two or thogona I components — true score variance and 
random measurement error variance (Lord and Novick, I968) . According to 
the classical true score model, a true score is a hypothetical expected 
value of X over repeated measurements, and true score variance is 
r^lected in the covarlance among different attempts to measure the same 
underlying construct. The random error component of this model is 
defined as the difference between observed and true scores, and 
according to the model, random error' variance is reflected in the 
difference between the variances of true and observed scores. Random 

6 
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meaturement error It uncorrelated with true tcore variation and with 
variation In other error tcoret. 

The ctattlcal true tcore model of measurement error hat provided 
ptychologi ttt wl th methods for estimating the reliability of composite 
measures, that Is, for estimating the proportion of observed variance 
that Is considered non-error variance. Traditionally, coefficient alpha 
(Cronbach, 1951) has been heavily relied upon as an estimate of Internal 
consistency reliability. Certainly, this has been the case In the 
evaluation of TAT n Ach i evement (e.g., Entwisle, 1972). Despite Its 
popularity, alpha may not always be the most appropriate coefficient for 
reliability estimation. It can be shown that unless measures 
demonstrate tau-equlvalence (or essentlal-tau-equ!valence) (see Lord and 
Novick, 19681 p.90) , coefficient alpha will under-estlmate the 
reliability of composite measures. Other, more general, reliability 
formulations are appropriate for estimating internal consistency 
reliability of composites when the assumption of tau-equlvalence cannot 
be met (Joreskog, 197B) . Recently developed methods of confirmatory 
factor analysis make It possible to evaluate the appropriateness of 
various measurement models for a particular set of measures (Joreskog, 
197^4). In this paper we demonstrate empirically the conclusion that 
others have derived analytically, that when a tau-equi valent measurement 
model' does not fit the data Well, coefficient alpha under-estimates the 
reliability of the composite measure. 

1 1 therefore becomes Important in re 1 I abl 1 i ty est I mat I on to 
consider the appropriateness of the measurement model underlying the 
coefficient used. The most general linear model for estimating the 
reliability of composite measures is Joreskog 's (1978) congeneric 
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measures model* A!! other modeli for reliability estimation represent 
restrictions on thli model* This model does not assume tau-equlvalence» 
nor does It place any constraints on the variances of errors, as In the 
case of parallel measures Jsee Alwin and Jackson, 1979) Joreskog, 1978). 
In the following discussion we demonstrate how such differing models for 
measurement error may be^ specified within Joreskog's '^analysis of 
covariance structures" framework and how the f i t of such models to 
empirical data can be examined. 

Consequences of modeling measurement error for validity ^^tlmation 

According to classical test theory, the correlation between two 
theoretical variables will be greater than, the correlation between two 
measured I nd I cators of those theoretical variables whenever random 
measurement error is present in the indicators. If the magnitude of 
random measurement error is known or can be estimated, the classical 
test theorist may disattenuate the correlation between the measured 
indicators. The d i sattenuated correlation Is treated as a standard for 
inferring the "true" relationship between the theoret i cal cons tructs 
(Lord and Novick, 1968, p.69) . In fact, the objective of determining 
such true relationships between theoret i ca T constructs originally 
prompted the development of test theory In psychology.' 

The pattern of relationships among measured indicators may look 
quite different from the pattern of relationships among those unmeasured 
constructs which determine variation in the measured indicators. 
Suppose constructs A and B are related to a third construct, C, to 
exactly the same extent. Given the principles of classical test theory, 
if the indicators of A are substantially more reliable than the 
indicators of B, we would observe greater covariation betvyeen the 

/ 
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indlcatori of A and C than betwaen the Indicators of B and C. Analysts 
who* consider only observed covariation might fall to conclude that- the 
constructs t A and B, are equally related to C. Alternatively, suppose 
that the true covariation batween constructs X and Z Is significantly 
greater than the true covariation between constructs Y and Zt but the 
Indicators of X are much lass reliable than the Indicators of Y. Given 
the principles of classical test theory, the observed relationship 
between* Indicators of X and Z would be attenuated to a greater extent 
than the observed relationship between indicators of Y and Z. The 
observed relationship between X and Z Indicators might be not 
significantly different from the observed relationship between y and Z 
Indicators. In this case, analysts who consider on]/^ observed 
covariation might fall to conclude' that the constructs . X /and Y, are 
differentially related to construct Z. 

Failure to move beyond simple consideration of covariation among 
observed variables has led critics of thematic apperceptive methods to 
misinterpret evidence relevant to the validity of the achievement motive 
as a theoretical construct. Entwfsle (1972), for Instance, acknowledged 
that 'TAT n Achievement Is much less , re 1 i able than TAT story length, 
without actually going the next step of Incorporating this knowledge 
Into the discriminant val Idatlon endeavor . Instead, the comparison was 
based on the magnitude of criterion validity coefficients, which 
confound measurement error and true covariation between constructs. We 
be 1 1 eve TAT n Achievement has been particularly vulnerable to such 
Interpretive errors, precisely because Its reliability tends to be much 
lower than that of lndi,cators of other constructs. Here we illustrate 
an approach that overcomes these errors;' 

- d 
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^ Method 

Sample 

Tha data raported hara orlglnata In a 1976 natlona-1 aurvay :dailgnad 
to obtain muUlpla maaturai of well-balng and role reaction! j(Varoff, 
Douvan, and Ku1ka» 1981). The reipondenti, drawn from randomly lampled 
houieholdt within the contlguoui United Statei, were choten to fepreient 
Americana over the age of 2). A random aubiample (N - 1 .'^98) was. 
administered the TAT near the outlet of each interview (aee Veroff, 
Depner, Kulka, and Douvan, 1980, for further deicrlptlon of vthe lurvey 
methods used) . ^ ' 

Measures 

Respondents told Imaglratlve stories to six pictures, selected to 
deplete variety of life jlituations with which respondents would 
presumably be familiar. Standard TAT prompts used in laboratory studies 
of motivation helped respondents develop a story plot for each picture. 
Stories were transcribed verbatim by the interviewers and later coded 
for achievement Imagery and Its subcategories using the content analysis 
procedures developed by McClelland, Atkinson, Clark, and Lowell (1958). 
Veroff et al. (1980) report satisfactory levels of Interscorer coding 
reliablMty. A total protocol for a respondent was Judged inadequate 
for assessing strength of the achievement motive if any story contained 
non-imaginative responses to more than one prompt or if more than three 
stories were missing one prompt each. On this batf I s 'adequate " protocols 
are available for 508 males and 700 females In the survey. Here we 
analyze data for the 508 males with adequate TAT protocols. 

For factor analytic reasons, only TAT items which were not 
seriously skewed (skewne^s < 2) were considered here/ The four pictures 
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yielding such reiprnse dlstrlbutloni for males are (given with their 
actual order of pretentatlon I n parentheses): 

(0 Two men (Inventors) In a shop work I ng at a 

machine. • 

(2) Four men seated at a table with coffee cups. 

One man Is writing on a sheet of paper. 
(4) Man at drafting table. 
^ (5) Conference group - seven men variously grouped 

around a conference table. 
The education constrilct considered here Is the amount of forma^l 
schooling each re$pondent\ received. The Indicator of this construct Is 
coded (1) If the respondents^ completedio'nly grade school, (2) If the 
respondent/ completed some\hlgh school, (3) If the respondent completed 
high schodl, W If the respondent completed some college, and (5) If 
the respondent completed collpge. 

F Ive I nd lea tors of y ntr Insle satisfaction at 
from the I976 "^survey . These Include; 

o (1) How r.iuch satisfaction (hive you gotten/would you 
^ get/did you get) out of work at a jobf [Coded 
(1) No satisfaction, (2) LI ttle satisfaction, 
(3) Some satisfaction, (k) Great satisfaction.]; 
(2) Here are some things that might describe a 
person^ s Job. Please tell me true each Is 
of your (main) job, uslr^g one of the answers on 
this card. The flrst\one isj The worl( Is 
Interesting. [Coded (1) h^ot at all true, (2) 



work were selected 
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Not^ very true, (3) Somewhit true, W Ver^- 
true.] ; . , 

(3) [This indicator was preceded by the Berne preface 
I uted for Indicator 2.] I am given a chance to 
do the thingt I do beat. [Coded the aame at 
Indicator 2.] j 
{k) How much (hai/would/did) work at a^ lob (led/ 
lead) to (the moit Important value) In your 
life? [The Interviewer tubtt I tuted that ve 1 ue 
which the respondent had Just lelected as most 
Important from a list of rjiir^e I'lfe values 
proposed by Rokeach (1973)4^ [Coded (1) Very 
little, (2) A little, (3) Some, (^4) A lot* (5) A 
• great deal.*-^^ 
(5) Taking Into consideration all .the things about 
your Job, how satisfied or dissatisfied are you 
R^lth It? [Coded (1) ^ieutral, emblvalent, or 
[ssatlsf led, (2) Satisfied, (3) Very 
satl>f led.] . 



Except for the lastTtem, these indicators used a forced, chol ce response 
format. Since three of these measures wire asked only of currently 
employed respondents, the number of cases for analyses Involving work 
satisfaction Indicators is ^13* 
Specification of measurement ipodels 

/ In order to select a measurement* model , we begin with a varlance- 

covar lance matrix calculated from the fpur non-skewed TAT n Achievement 
ItemsJ We attempt to identify an hypothesized covarlance matrix, X , In 



Li 
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which elemtnta 4re equal to their corresponding elements In the observed 
^i/erTa^ 2 Is given by the factor analytic 



model 

where A It a matrix of factor pattern coef f I c I ents re) at I ng the e 

measured variables to unmeasured common factors; <¥> specifies the 

— - 2 
variance-.covar lance matrix relating the k common factors; and ^ Is a 

var I ance-covar I ance matr I X of £ disturbance terms (Joreskog,- I969, 

1974). We have assumed, without loss of gcsnerallty, that all factors 

are centered (lie., have mean zero) In the analyses that follow. in 

analysing var I ance-cover I ance matrices of TAT data with COFAMM (Joreskog 

and Sorbom, 1976), we have specified four models: 

(1) the null model Implies k £• ^ ■ JL' * a 

diagonal matrix' with certain elements 

2 2 
constrained to be .equal ; and - 0; 

\ 

(2) the paral lei measures model implies that 
Indicators must be unlvocal (I.e.., load only on 
one factor), and that elements In A are 

' constrained to be equal for all Indicators of a 
factorf diagonal elements of. * are set at KO, 
and of f*-di agonal elements of ^ arc free (i.e., 
estimated); and ^ Is diagonal with elements 



constrained to b^e^^equal witpin each set of 
Indicators of a factor ;\ 
(3) the tau-equi val ent measures modeL Impl ies that 



Indicators must be unlvocal, apd / that element-s^^ 
In A are constrained to be equal for all 
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Indicators of a factor; diagonal alements in <T> 

are set at 1.0, and of f^-di agonal elements In 'I' 
2 

are free; and is diagonal and free; 
(4) the congeneric measures model implies that 
indicators must be univocal* and that elements 
in A are estimated for all indicators of a 
^ factor; diagonal elements In «r> are set at 1 .0, 
and off-diagonal elements i n are free; and 
is diagonal and fr^ee. 
in general, in order to identify uniquely any of these models it is 
necessary to impose k_ restrictions on the elements in A and/or cf> . In 
most cases this may be accomplished by (1) fixing one element In each 
column'of A to unity and k - 1 elements in each column to zero, or (2) 
fixing the diagonal of <t> to unities and k - 1 elements in each column 
of A to zero. These are necessary but not! sufficient conditions, for 
Identification (Joreskog, 1979) • 

The goodness of fit of various measurement models we specify may be 
evaluated statistically using the methods of max 1 mum-* 1 1 kel 1 hood 
confirmatory factor analysis. The likelihood-ratio statistic is 
distribute^ as with g degrees of freedom, and It Is possible to test 
the hypothesis that a particular s^^ecl f i cat I on of values for the free 
parameters in A, ^ , and >1^^^ill reproduce the population variance- 
;ccvarlance matrix, 2 . The degrees of freedom assoc i ated wi th a model 
is In general equal to .5 E (fi + 0 - t, where t is the number of 
par^mettrs of the model that are unconstrained. A nonsignificant £- 
value associated with a model implies that the hypothesized factor 
analytic model generates a matrix of estimated variances and covariances 
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which does not ^differ • i gni f I cant 1 y from the observed varlence- 
coverlence matrix. However, one of the problems with the simple 
application of these statistical inference tools Is that the value of X 
Is Influenced by" sample size, in sufficiently large samples the value- 
of may cause one to reject non-trivial models on statistical grounds 

(Joreskog, I969) . Therefore, as suggested by Joresicog (1978) • we also 

V 2 2 

compare changes In the ratNo of X to degrees of freedom (X /df) as we 

compare models. As the X^/]^ ratio of a model approaches 1.0, fit* 

Improves. In add 1 1 1 on we make use of Bentler and Bonett's (I98O) normed 

fit Index to scale the Improvements in fit In standardized units. 

Resul ts 

Selecting a measurement model for TAT n Achievement 

Table 1 displays measures of fit for true score measurement models 

and their appropriate null model, when these models are applied to the 

observefl var 1 ance-covar I ance matrix of TAT n Achievement indicators for 

1976 men. It Is evident that the null model and the parallel measures 

model may be rejected since their X^/df ratios are substantially greater 

than 1.0, and their g-values indicate extremely poor fit. The tau- 

kqulvalent measures model provides a h I ghl y s I gni f 1 cant Improvement in 

fit over the parallel measures model (difference X (3) ■ 36.8^0, £< 

.001).; Yhe congeneric measures model does not Improve significantly upon 

2 

the tau-equi valent measures model (difference X (3) ■ 3.792i £ > -250). 

2 -1 
Finally, we Infer from Its X /df ratio and £-ya1ue that ^ the tau- 

equivalent measures model fits the data very well. These observations 

lead us to prefer the tau-iequi valent measures model in accounting for 

the relationships among the four Indicators of the achievement motive 

for men In the sample. 
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Table 1 here 



Since these true score models may be arranged hierarchically, we 
might also determine ^a preferred model from en examination of normed 
incremental fit indices (Bentlar and Bonatt, igBO, p,599) * The Bantler- 
Bonett normed fit indices;' calculated from Table 1, are ^12" i^92, ^ 23 

- .i*52, and S - .0^46. These incremental indices showimaller 

- 34 ^ 

improvements In fit, as the hierarchical step-up comparison of two 
models proceeds. The overall normed fit Index for our preferred tfiu- 
equivalent measures model is - .9^^. an acceptable value by Bentler 
and Bonett*s standards (196O, p.600) • / 

Figure 1 displays parameter estimates of the preferred measurement 
model for 1976 male TAT n Achievement. A reliability coefficient for 
the indicators of the achievement motive in this tau-cqu ! va1 ent measures 
model (Alwin and Jackson, 1979. P-96) is .356- This Internal 
consistency coefficient is quite similar in magnitude to the median 
Internal consistency reliabHitles reported in psychometric revTews pf 
TAT n Achievement (Entwi^^ 1972; Fineman, 1977). Unlike previous 

reliability estimates, however, our estimate is derived from an 

\ 

explicit, empirically justified measurement model. 
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F igure 1 here j 



Relationships among TAT n Achievement, TAT story -length-, and /education 

The zero-order correlation between the unweighted sum of .the four 
non-skewed TAT n Achievement items and education for 1976 male 
respondents is .151 (df- 1, 503j £- .0007); the zero-order correlation 
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betwttn the unweighted sum of the ttory length measures for these same 

four TAT Items and education Is .193 (df - 1 • 503; E < -0001) . In a 

multiple regression analysis with education as the dependent variable, 

the standardized partial regress Ion. coeff 1 c lents for TAT n Achievement 

and story length ere .100 (fi - .02^4) and .157 (fi " .000**), respectively. 

This pattern of criterion validity coefficients replicates Entwlsle's 

(1972) flndfng that story length Is nom 1 nal 1 y the more potent predictor 

of an educational attainment criterion. 

Believing the causal .relationships among TAT n Achievement,- TAT 

story length, and education to be ^^ambiguous In our cross-sectional 

survey design, we proceed to determine the correlation coefficients 

(rather than structural equation parameters) relating these three 

constructs, now taking measurement models for the constructs Into 

consideration. Using confirmatory factor analysis, we posit several 

models that attempt to reproduce the observed var i ance-covar I ance matrix 

relating the four indicators of n Ach I evement , four Indicators of story 

length, and the single Indicator of education. 

Table 2 displays measures of -fit for these models. Model 1 

specTfles the^null model In which all variance associated with each 

indicator Is random error variance, and error variances are constrained 

to be equal f or I nd I cator s of the achievement motive and for Indicators 

of story length. As shown In Table 2, this hypotheslied nul 1 model 

2 

Should be rejected since Its X /df Is substantially greater than 1.0, 
and Its g-value Is h I gh 1 y s I gn I f I cant . MojieMs 2 through 6 specify true 
score measurement models for the n Achievement and story length factors 
which are hypothesized to underlie the observed Indicators. Since only 
one indicator of the education factor Is Included In the analysis, and 
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therefore meaiurement models. for that factor cannot be estimated from 
these data, we Impose the assumption that 6.7 percent of , the variance In 
the Indicator Is error, variance, based on prior research which has 
estimated the reliability of such^|in Indicator (Wheaton, Muthen, Alwln, 
and Summers, 1977i p. 1 12) . 



Table 2 here 

_ / 



Contrasts among hierarchically nested measurement models lead us to 

po^lt Model k as the preferred model for these data.^ Model ^ specifies 
tha^t^ the n Achievement Indicators are congeneric, and the story length 
Indicators are tau-equlvalent. The model fits the data very well, as ^ 
shown by Its X^/df equal to 1 .08A and Its nons I gnl f I cant £-value . . 
Bentler and Bonett ' s* normed fit Index for Model k In Table 2 Is .980. 
Model A Is preferred because It Is a significant Improvement upon more 
restrictive models (Models 5 and 6) and Is not significantly Improved 

V 

upon by a less restrictive mode'l (Model 2) . 

In this preferred model (see Figure 2), we note that the 
correlation between n Achievement and education, which has been 
dl sattenuated for the unreliability of the Indicators of those 
constructs, is .256 (SE ■ .073) • The d I sattenuated validity correlation 
between the story length and education factors Is .210 (SE - .0A6) . The 
relative magnitude of these ''true'* validity coefficients implies that 
TAT n Achievement is In fact as potent as story length in predicting 
educational attainment. The relative magnitude of the "true" validity 
coefficients Is the reverse of the relative magnitude of the "observed'' 
validity coefficients. These results cal 1 I nto question the conclusions 
drawn by Entwisle (1972) . 

■16 
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, i Figure 2 here 

We note f\na)ly that the preferred measurement model for the 
achievement motive Indicators Is Influenced by the sample of Indlca^tors 
of other constructs selected for analysis. Given Indicators of the 
achievement motjve, story length, and education, we novy^ prefer a 
congeneric measures model for the achievement motive Indicators.*. Given 
only Indicators of the achievement motive earlier, we preferred"^ tau- 
equivalent measures model. We Interpret this change of preferred mo^s 
to Imply that these I ndlcators of the achievement motive are 
differentially related to the Indicators of other constructs, although 
they are equally related to the achievement motive construct Itself. 
The rel labi llty coefficient derived from the congeneric measures model 

; / - * 

of the achleveme^ht motive Indicators in Figure 2 is .383t similar to 
that obtained LarWT In Figure 1.^ The reliability coefficient derived 
from, the tau-equi valent measures model of TAT story length in Figure 2 
is .gU. The fact that the Indicators of this construct are so much 
more rellaible than Indicators of the achievement motive construct 
Implies that observed validity coefficients of the story length 
I ndlcators wl 1 1 be changed less when \ dl sattenuated for measurement 
error, compared to V observed val Idlty coefficients of the achievement 
motive Indicators. 



The discriminant validity of TAT n Achievement and story length 

Tabia 3 displays fit Indices for several confirmatory factor- 
analytic models which attempt to reproduce t)Vi^ observed varlance- 
covarlance matrix of relationships among Indicators of the .achievement 
motive, story length, and I ntr I nsi c wor k satisfaction.^ Mt Is clear 
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from its fit indices that the null model may be rejected. Systematic 
contrasts between measurement models for the three constructs lead us to 
prefer Model 4» which hypothesizes that the indicators of the 
achievement motive and work satisfaction are congeneric » and the 
Indicators of story length are tau*equ ivalent» consistent with the 
findings presented in Table 2. Model Wits reasonably well (X /df > 
1.362; £ .0278). Bentler and Bonett's normed fit Index for Model k in 
Table -3 it •953* Model k is a significant Improvement upon more 
restrictive models (Models 5 and 6) and is not significantly improved 
upon by a less restrictive model (Model 2) . 



Tabl e 3 here 

Th^ correlations among the three theoretical variables are 
displayed In "Figure 3. The estimated correlation between Lhc 
achievement motive construct and the work satisfaction construct is .277 
(SE * .086); the estimated correlation between the story length 
construct and the work satisfaction construct is -.011 (SE • .057)* 
These, correlations are significantly different {£ < .00.2), demonstrating 
the discriminant validity of the achievement motive construct and the 
story length construct with respect to the work satisfaction construct. 

F 1 gure 3 here 



Cone 1 us ions 

Through the systematic application of principles of classical test 
theory, we have provided evidence here for the construct validity of the 
achievement motive. Contrary to the claims of Entwisle (1972), our 

. ■ ■ 
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analyiis luggesti that TAT n Achievement and TAT story length are not 
equivalent construct!.' The achievement motive Is expected to Influence 
behavior In an axtenilve set of* BltuatlonB where individual! compete 
with a itindard of excellence. TAT itory length reflect! ■more 
local lied conatruct, verbal ichievement. 

Wa have di f f arent I ated .meaiured var I ablea used ai Indicator! of a 
theoretical con!truct from the unmea!Ured theoretical construct I tse 1 f . 
The problem of eatlmating re 1 at I on!h I ps between a theoretical ^variable 
and mea!Ured Indicator! of that theoretical variable* based on sample 
obaervatlon! of covariation among the IndlcatorSi is conceived here as a 
reliability lasuel .The problem of est Imat I ng .the re 1 at I onsh i p between 
two (or more) theoretical variables, based on sample db'sery'ati'fins of 
covariation between Indicators of each construct, "Is conceived as a 
validity Issue. Critics of thematic apperceptive methods have failed to 
distinguish between criterion validity coefficients and validity 
coefficients that show the relationships between theoretical constructs 
(I.e., validity coefficients which' have betn >d I sattenuated for 
measurement error In Indlcatprs of the constructs),' As a consequence of 
this confounding, critics of TAT- n A;chl evement have reached particularly 
mistaken conclusions, precisely because random measurement error Is so 
much more substantial in thematic apperceptive indicators of the 
achievement motive than In Indicators of other constructs. 

By predicating, our analysis of the validity of the achievement 
motive on explicit true score measurement models, we reach estimates of 
relationships among theoretical constructs which are more theoretically 
Justifiable than those which have been reached by critics of thematic 
apperceptive methods. Although we believe our approach improves the 
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Interpretation of evidence bearing on theoretical propositions, we make 
no claims for Improving the practical utility of the TAT, for Instance, 
as an Instrument for selecting Individuals into motivational 
Intervention programs. Entwisle^t (1972) cr I'ticlsms of TAT 
n Achievement are in part directed at such practical limitations of 
unrellfble measures. We .would argue simply that questions concerning 
the theoretical status of TAT n Achievement need not be answered I n the 
same way as questions concerning Its practical status. 

Future research may refute our choice of a model that specifies 
that a large proportion of the variance In TAT n Achievement Items is 
due to random measurement error. Motivational theorists (Atkinson, 
Bongort, and Price, 1977; Reuman, In press) have hypothesized that this 
"random error variance" Is in fact systematically related to 
motivational dynamics. Our present measurement framework does not 
address such hypotheses directly. We can at best assert that we have no 
evidence here that true score measurement models should not be applied. 
As long as many researchers continue to Interpret low correlations among 
Indicators of a construct as a sign of the presence of ^random 
measurement error, we would at least recommend that validity/ 
coefficients be estimated In conjunction with empirically Justifiable 

measurement models for theoretical constructs. • j 

■j 
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Footnotes 

'we believe an analysis of the var I ance-covar 1 ance matrix Is 
warranted over an analysis of the correlation matrix for two reasons. 
First. It Is logically Impossible to test/fhe fit of an appropriate null 
model for the parallel measures model, when a matrix of correlations 
serves, as the empirical basis for the test. Such a null model 
hypothesizes that all I tern variance Is mndom error variance and that 
the magnitude of this error variance Is equal for all Items. The 
variances of all Items have neceissar I 1 y been constra I ned . to be equal In' 
a correlation matrix, however, so this equality constraint cannot be^ 
examined when a correlation matrix Is used. For models where this type 
of constraint Is not preseiHt, It does not matter whether one analyzes a 
correlation or covariance matrix. The results, except for scale,' are 
identical. Second, as long as the Indicators of a construct use the 
same'metric, we believe the i tem var I ances are inherently Informative 
and should not be standardized. 

^To specify a null .^model that Is appropriate for* (I.e., more 
restrictive than) the parallel measur'es modal • we must constrain error 
variances to be equal for, all elements in the diagonal of ^ that 
correspond to Indicators of a singje construct. If a parallel measures 
model cannot be meaningfully specified In principle, as Is the case when 
the Indicators cf a construct use different response metrics, an 
appropriate nul 1 model does not constrain error variances to be equal. 

^Slnce'an interviewer may have Inaccurately transcribed a TAT 
story, and the story would have been coded both for the achievement 
motive and for story length, we hypothesized that some correlated error^ 

may be present^ for an indicator of motive strength and a corresponding 

s 
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Indlcitbr of itory length which were bated on the tame TAT ttory 
(Campbell and FItke, 1959J Alwin, 197^). Even though the correlated 
error attumption it somewhat compelling on ponceptual grounds, we found 
oo'TTgnTfTcant improvement in fit for several models which"^ Included this 
atsumptioni in addition to the astumptions of Models 2 through 6. 

^The. slightly larger reliability estimate derived from the 
congeneric meaturet model in Figure 2, compared to that derived from the 
tau-equlyal ent meaturet model In Figure 1, is consistent with 
expectationt bated on clatslcal test theory. 

^As In the ^previous section, we found that the assumption of 
correlated error variance does not significantly Improve the fit of 
model s i n Table 3. 

^Since the Indicators of work satisfaction do not use the same 
metric, measurement models more restrictive than the congener I c measures 
model are Inappropriate. The parallel measures and the tau-equi valent 
measures models, which assume equal true score variance in all 

Indicators of a construct, are only pla,usible if the indicators have at 

least been measured on the tame tcale. 
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"Table 1 

Fit Indlcei for Meaaurement Models 
of 1976 Male TAT n Achievement 



Model rr ■ df XVdf g-value 



1. Null model / B\ .^77 9 9-053' <.0001 

2. Psral lei measures model ^41.392 6 5-i7'* <.000I 

3. Tau-equI valcnt measures model '♦•552 5 .9iO .'♦730 
k. Congeneric measures model .76O 2 . 38O .6838 



Table 2 

Fit Indices for Models of 1976 Hale 
n Achievement, Story Length, and Education 





Model 


x2 


a 


X^/df 


£-va 1 ue 


1 . 


Nul 1 mode 1 




1558. 27'* 


142 


37. 102 


<.0001 




Measurement 


Models for 










n 


Ach 1 evement 


Story Lenath 






\ 




2, 


C 


C 


27.367 


26" 


1 .052 


.3903 


3. 


T 


c 


i» 1 . 396 


29 


1 .1427 


.0636 


<». 


C 


T 


31. '♦38 


29 


1 .o8'4 


.3'«51 


5. 


T 


T 


145. 69'* 


32 


1.i428 


.0553 


6. 


C 


P, 


14 '4.098 


32 


1.378 


.0755 



Note > C - congeneric measures; T » tau-equ 1 va I ent measures; 
P » parallel measures. 
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Table 3 

Fit Indices for Models of 1976 Hale 
n Achievement. Story Length, and Work Satisfaction 





Mode 1 




if. 


X^/df £-value 



1 . 


Nul 1 model 




1890.7 1 1 




22.508 


<.0001 




Measurement 


Mode 1 s for 












Story 


Work 












n Ach Length 


Satisfaction 










2. 


C C 


C 


85.288 


62 


1.376 


.0266 


3- 


T C 


C 


98.601 


65 


1.517 


.001*5 


k. 


C T 


c 


88.5'* 3 


65 


1 .362 


.0278 


5. 


T T 


c 
c 


102.05'* 


6a 


1 .501 


.001*7 


6. 


C P 


98.809 


66 


1.1.53 


.0087 



Note * C * congeneric measures; T « tau-equi valent measures; 
P « parallel measures. 
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Figure 1 

Preferred Measurement Model for 1976 Male TAT n Achievement 




Note Upper coefficients are COFAMM parameter estimates when only the 
■factor is standardized. Lower coefficients are standardized 
with respect to the variance of the indicators as well, ^tarred 
•values were set or constrained to be equal in the initial so.l'ution. 
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Figure 2" 

Preferred 3-Factor Model for Representing the Covariances 
Among Indicators of TAT n Achievement, TAT Story Length, and Education 
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Pic4 
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2.147 

.944 
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Pic5 



T 
2.588 
.928 



Note. Upper coefficients are COFAMM parameter estimates when only the 
factor is standardized. Lower coefficients are standardized 
with respect to the variance of the indicators as well. .Starred 
values were set or constrained to be equal in the initial solution. 
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Figure 3 

Preferred 3-Feictor Model for Representing the Covariances 
Among Indicators of TAT n Achieveirient , TAT Story Length, 
and Work Satisfaction 
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Note. 



Upper coefficients are COFAMM parameter estimates when only the 
factor is standardized. Lower coefficients are standardized 
with respect to the variance of the indicator^ as well. Starred 
values were set or constrained to be equal in the initial solution, 



